Popular Summary 


Chemical Data Assimilation: A Case Study of Solar Occupation Data From the Atlas 1 Mission of the 

Atmospheric Trace Molecule Spectroscopy Experiment 


A key advantage of using data assimilation is the propagation of information from data-rich regions to data- 
poor regions, which is particularly relevant to the use of solar occupation data such as from ATMOS. For 
the first time an in depth uncertainty analyses is included in a photochemical model-data intercomparison 
including observation, representativeness, and theoretical uncertainty. Chemical data assimilation of solar 
occupation measurements can be used to reconstruct full diurnal cycles and to evaluate their chemical self- 
consistency. This paper considers as an example the measurements made by the Atmospheric Trace 
Molecule Spectroscopy Experiment (ATMOS) instrument Atlas- 1 during March 1992 for a vertical profile in 
flow tracking co-ordinates at an equivalent PV latitude of 38S. ATMOS was chosen because it 
simultaneously observes several species. This equivalent PV latitude was chosen as it was where ATMOS 
observed the atmosphere's composition over the largest range of altitudes. A single vertical profile was used 
so that the detailed diurnal information that assimilation utilizes could be highlighted. There is generally 
good self-consistency between the ATMOS Atlas-1 observations and photochemical theory. 
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Abstract. A key advantage of using data assimilation is the propagation of informa- 
tion from data-rich regions to data-poor regions, which is particularly relevant to the use 
of solar occultation data such as from ATMOS. For the first time an in depth uncer- 
tainty analyses is included in a photochemical model-data intercomparison including ob- 
servation, representativeness, and theoretical uncertainty. Chemical data assimilation of 
solar occultation measurements can be used to reconstruct full diurnal cycles and to eval- 
uate their chemical self-consistency. This paper considers as an example the measure- 
ments made by the Atmospheric Trace Molecule Spectroscopy Experiment (ATMOS) in- 
strument Atlas - 1 during March 1992 for a vertical profile in flow tracking co-ordinates 
at an equivalent PV latitude of 38° S. ATMOS was chosen because it simultaneously ob- 
serves several species. This equivalent PV latitude was chosen as it was where ATMOS 
observed the atmosphere’s composition over the largest range of altitudes. A single ver- 
tical profile was used so that the detailed diurnal information that assimilation utilizes 
could be highlighted. There is generally good self-consistency between the ATMOS Atlas- 
1 observations of 0 3 , NO, N0 2 , N 2 0 5 , HN0 3 , H0 2 N0 2 , HCN, C10N0 2 , HC1, H 2 0, CO, 
C0 2 , CH 4 , and N 2 0, and between the observations and photochemical theory. 


1. Introduction 

For more than two decades vertical profile measurements 
of trace gases have been made by solar occultation (e.g. from 
the Stratospheric Aerosol and Gas Experiment (SAGE), the 
Halogen Occultation Experiment (HALOE), Polar Ozone 
and Aerosol Measurement (POAM), the Atmospheric Trace 
Molecule Spectroscopy Experiment (ATMOS), and the Im- 
proved Limb Atmospheric Sounder (ILAS)). The observa- 
tions are self-calibrating and are therefore useful for the 
analysis of temporal trends. However, by definition the use 
of solar occultation limits the measurement opportunities 
to satellite sunrise and sunset. It is therefore useful to use 
chemical data assimilation cast in flow tracking co-ordinates 
to reconstruct full diurnal cycles and to check the chemi- 
cal self-consistency of the solar occultation measurements. 
The use of flow-tracking co-ordinates allows a more com- 
plete global view to be obtained from the sparse latitudinal 
coverage of the solar occultation measurements. 
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2. Why Use Assimilation? 

The question could be asked, £ How does using chemi- 
cal data assimilation differ from using reverse domain filling 
(RDF) isentropic trajectories where ozone depletion is com- 
puted along trajectories using aircraft and satellite data or 
other 2D and 3D photochemical models?’ 

Chemical data assimilation combines the observational 
information available from measurements with the theoret- 
ical information encapsulated into a deterministic model of 
atmospheric chemistry, together with the associated uncer- 
tainties of each. Specifically assimilation uses: 

• Constituent observations (in this case observations of 
14 constituents simultaneously). 

• Observational uncertainty. 

• Theoretical knowledge to propagate information from 
the observations available to other species and locations and 
evaluate chemical self consistency. 

• Provide an uncertainty on the analysis produced that 
accounts for observational uncertainty, temporal and spa- 
tial representativeness uncertainty, and theoretical (model) 
uncertainty, i.e. assimilation requires a specification of the 
error statistics of the observations and the underlying model. 

For example, Figure 1 below shows the time evolution 
of the NO and NO 2 analyses and the analyses uncertainty. 
Conventional modelling studies do not present the time evo- 
lution of uncertainty associated with their predictions. Very 
useful as they are, techniques such as reverse domain filling 
(RDF) isentropic trajectories, or 2D and 3D chemical mod- 
els are not able to do this. 

Assimilation should not be seen as competing with other 
modelling tools, rather its purpose is objective improve- 
ment by combining observational, theoretical, and uncer- 
tainty information formally and statistically in a mathemat- 
ical framework. 
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The key aim in developing such a powerful method is 
to look at scientific issues. However, the credentials of the 
method must first be rigorously established. The purpose 
of this paper is to evaluate critically the analyses produced 
by chemical data assimilation and evaluate its skill with ob- 
jective measures. Subsequent papers will use the method to 
look at a variety of scientific issues. For example: The role 
of halogens in the initiation and catalysis of methane oxi- 
dation in the upper troposphere lower stratosphere region. 
The time evolution and partitioning of nitrogen, chlorine, 
bromine, and hydrogen species and the relative important 
of ozone loss catalytic cycles from the start of the Upper At- 
mosphere Research Satellite (UARS) period to the present. 

2.1. Flow Tracking Coordinates 

Under adiabatic conditions air parcels move along isen- 
tropic surfaces (surfaces of constant potential temperature, 
9). So when considering tracer fields 9 is a suitable verti- 
cal coordinate. McIntyre and Palmer [1983, 1984], Hoskins 
et al. [1985], and Hoskins [1991] have shown the value of 
isentropic maps of Ertel’s potential vorticity (PV) for visu- 
alising large scale dynamical processes. PV plays a central 
role in large scale dynamics where it behaves as an approx- 
imate material tracer [ Hoskins et al , 1985]. 

As a result, PV can be used as the horizontal spatial coor- 
dinate instead of latitude and longitude [ Norton , 1994; Lary 
et al , 1995a]. PV is sufficiently monotonic in latitude on an 
isentropic surface to act as a useful replacement coordinate 
for both latitude and longitude, reducing the tracer field 
from three dimensions to two. These ideas have already led 
to interesting studies correlating PV and chemical tracers 
such as N 2 0 and O 3 [ Schoeberl et al, 1989; Proffitt et al, 
1989, 1993; Lait et al, 1990; Douglass et al, 1990; Proffitt 
et al, 1989, 1993; Atkinson , 1993]. A key result of these 
studies is that PV and ozone mixing ratios are correlated 
on isentropic surfaces in the lower stratosphere, as was first 
pointed out by Danielsen [1968]. 

Since the absolute values of PV depend strongly upon 
height and the meteorological condition, it is useful to nor- 
malise PV and use PV equivalent latitude (4> e ) as the hor- 
izontal coordinate instead of PV itself. <p e is calculated by 
considering the area enclosed within a given PV contour on 
a given 9 surface. The <j> B assigned to every point on this PV 
contour is the latitude of a latitude circle which encloses the 
same area as that PV contour. Therefore, for every level in 
the atmosphere <f> e has the same range of values, -90° to 90°. 
This provides a vortex-tracking, and indeed a flow-t racking, 
stratospheric coordinate system. 

We have taken these now well established ideas and used 
them as a framework for our chemical data assimilation. 
This approach is valid for our analysis interval of one day, 
and often for up to ten days or longer in the stratosphere. 
Because a major component of the variability of trace gases 
is due to atmospheric transport it makes sense to use a co- 
ordinate system that ‘follows’ the large scale flow pattern to 
perform our data assimilation. 

3. ATMOS Observations 

As an example of how chemical data assimilation cast in 
flow tracking co-ordinates can be used to extend the value of 
solar occultation datasets we consider the Atlas - 1 data from 
the Atmospheric Trace Molecule Spectroscopy Experiment 
(ATMOS) instrument. 

ATMOS is an infrared spectrometer (a Fourier transform 
interferometer) that was designed to study the chemical 


composition of the atmosphere. The ATMOS instrument 
has flown four times on the Space Shuttle, Spaceiab-3 in 
April 1985, and Atlas 1, 2, and 3 in March 1992, April 1993, 
and November 1994. In this study we use all the data from 
Atlas 1 for a vertical profile in our flow tracking co-ordinates 
at an equivalent PV latitude, <f> e , of 38° S. This equivalent 
PV latitude was chosen as it was where ATMOS observed 
the atmosphere’s composition over the largest range of alti- 
tudes. We wanted to study a single vertical profile so that 
the detailed diurnal information that assimilation provides 
could be highlighted. We assimilated ATMOS Atlas- 1 obser- 
vations of 14 species, namely: O 3 , NO, NO 2 , N 2 05 , HNOa, 
H0 2 N0 2 , HCN, C10N0 2 , HC1, H 2 0, CO, 0O 2 , CH 4 , and 
n 2 o. 

The relative role played by the observations of different 
species in the assimilation is an interesting issue studied in 
Lary [submittedb]. Based on the method described in Lary 
[submittedb] the top ten ranking of chemical information 
content for this vertical situation over an entire diurnal cy- 
cle is 0 3 , CO, H0 2 , N0 2 , CIO, CH 3 OO, HCHO, NO, BrO, 
and H 2 0. It is not at all surprising, in fact rather reassur- 
ing, that O 3 comes out as number one. We also see key 
representatives of carbon, nitrogen, hydrogen, and bromine 
species reminding us how coupled atmospheric chemistry is. 
It is interesting that two of the top ten or peroxy radicals, 
number three is H0 2 and number six is CH 3 OO. It is also 
noteworthy that the methyl species are rarely considered in 
stratospheric studies yet have significant information con- 
tent. 

4. Chemical Assimilation Scheme 

Data assimilation enables better use of observations of 
atmospheric chemistry. An example of the way assimilation 
can reconstruct a full diurnal cycle from occultation obser- 
vations is shown in Figure 1. This example will be discussed 
more later. 

The first use of chemical data assimilation was by Fisher 
and Lary [1995]. Khattatov et al [1999] have described the 
chemical Kalman filter used in this study. Other studies us- 
ing assimilation include Elbem [1997]; Elbem et al [1997]; 
Eskes et al [1998, 1999]; Khattatov et al [1999]; Lary [1999]; 
Elbem et al [2000]; Khattatov et al [2000, 2001]; Wang et al 
[2001]; Smyshlyaev and Geller [2001]; Clerbaux et al [2001]; 
Errera and Fonteyn [2002]; Chipperfield et al [2002]; Daescu 
and Carmichael [2003]. The photochemical mode used is the 
extensively validated AutoChem model [Fisher and Lary, 
1995; Lary et al, 1995b; Lary, 1996]. The model is explicit 
and uses an adaptive-timestep, error monitoring time inte- 
gration scheme for stiff systems of equations [ Press et al, 
1992; Stoer and Bviirsch, 1980]. AutoChem was the first 
model to perform 4D variational data assimilation (4 D-Var) 
Fisher and Lary [1995] and now also includes a Kalman fil- 
ter Khattatov et al [1999]. AutoChem uses kinetic data 
largely based on Atkinson et al [1997]; DeMore et al [1997] 
with updates for nitrogen chemistry based on Brown et al 
[1999a]; Portmann et al [1999]; Brown et al [1999b]. 

A key part of the chemical model is the calculation of 
photolysis rates. In this study they are calculated using 
full spherical geometry and multiple scattering [Anderson, 
1983; Lary and Pyle, 1991a, b; Meier et al, 1982; Nicolet 
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Figure 1 . Diurnal cycles of NO, and NO 2 vertical profiles for the 30 March 1992 assimilation of ATMOS 
Atlas- 1 data at 38°S. The left hand panel show the chemical analyses produced by assimilation overlaid 
with the observations (color filled circles). The right hand panel show the analyses uncertainties overlaid 
with the observation uncertainties (color filled circles). The reduction in analyses uncertainty can be seen 
in the right hand plots when new observational information comes in. The dots indicate the locations of 
the assimilation analyses grid. A single vertical profile was used so that the detailed diurnal information 
that assimilation utilizes could be highlighted. 


et al . , 1982] corrected after Becker et al [2000]. The pho- 
tolysis rate used for each time step is obtained by ten point 
Gaussian- Legendre integration [Press et al., 1992]. These 
calculations are updated on every assimilation iteration to 
ensure that the improved ozone profile at a given equivalent 
latitude is used to calculate the photolysis rates. 

The chemical system under study here contains a total of 
60 species. 55 species are time integrated, namely: 0( l D), 
0( 3 P), O3, N, NO, NO2, NO3, N2O5, HONO, HNO3, 
HO2NO2, CN, NCO, HCN, Cl, Cl 2 , CIO, ClOO, OCIO, 
CI2O2, CINO2, CIONO, CIONO2, HC1, HOC1, CH3OCI, 
Br, Br 2 , BrO, BrON0 2 , BrONO, HBr, HOBr, BrCl, 
H 2 , H, OH, H0 2 , H 2 0 2 , CH3, CH3O, CH3O2, CH3OH, 
CH3OOH, CH3ONO2. CH3O2NO2, HCO, HCHO, CH 4 . 
CHsBr, CF2CI2, CO, N2O, CO2, H2O. The remaining 5 
species are not integrated and are not in photochemical equi- 
librium, namely: O2, N2, HCl(s>, H20(s), HN03(s>. The 
model contains a total of 420 reactions, 278 bimolecular re- 
actions, 32 trimolecular reactions, 60 photolysis reactions, 4 
cosmic ray processes, and 46 heterogeneous reactions. 

The first guess used came from a January simulation 
of the Cambridge 2D chemical model [ Law and Pyle , 
1991, 1993a, b|. As these are not particularly well suited to 


the conditions observed by ATMOS during March 1992 our 
chemical assimilation was cycled three times. The results 
presented are from the third cycle. This allowed for quite 
rapid convergence to the conditions observed by ATMOS. 
E]ach cycle involves a first guess simulation, followed by two 
iterations of the chemical Kalman filter, and then an analysis 
simulation. The analysis simulation is a free running chemi- 
cal simulation starting from the chemical state vector at the 
end of the chemical Kalman filter. This is done to mimic the 
way 4D-variational data assimilation works with the added 
advantage of having the time evolution of the full species 
error co-variance matrix. This is desirable as it means there 
are no steps in the species concentrations with time due to 
Kalman filter increments (i.e. no time discontinuities when 
new information is used). 

4.1. Observations 

The key difference between conventional modelling and 
data assimilation is the use of observations and information 
on observational and other uncertainties. The uncertain- 
ties we consider are the observational uncertainty quoted 
by the observation team, the representativeness uncertainty 
(i.e. the spatial variability over an analysis grid cell) and 
the photochemical theoretical uncertainty. 
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Figure 2. Vertical profiles for O 3 , NO, and NO 2 of analyses and observations (left panel). The mid- 
dle panel shows the skill measures described in §4.4, namely the observation increment (O-F) which 
is a measure of forecast skill, the analysis increment (A-F) which is a measure of model bias, and the 
analysis-observations (A-O) which is a measure of the analyses bias. The right hand panel shows the 
uncertainties, observational, representativeness, and analyses. All panels are from the 30 March 1992 
assimilation of ATMOS Atlas- 1 data at 38° S. Observations and observation error bars in blue. 
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We obtain information on the representativeness uncer- 
tainty and improve the signal to noise by using all observa- 
tions available within a grid cell and studying the probabil- 
ity distribution function (PDF) of these observations. The 
width of the PDF is used as a measure of the variability or 
representativeness uncertainty for the grid cell. When more 
than one observation is available in a grid cell we take the 
median of the PDF as the observation for that cell, and the 
median of the observation uncertainty PDF as the observa- 
tion uncertainty for that grid cell. So both the observation 
and observation uncertainty are obtained directly from the 
data in a way that they are truly representative of that grid 
cell. 

The Kalman filter assimilation is a multivariate technique 
which can simultaneously process observation /uncertainty 
information pairs of the 55 time integrated species, when- 
ever and wherever they are available, together with the the- 
oretical information from our photochemical model. 

4.1.1. Temporal Search Criteria. 

The criteria used to determine at what time we use an 
observation are local solar time and solar zenith angle. In 
the case of short lived species, or those with a significant 
diurnal cycle, such as NO and NO 2 , the observation solar 
zenith angle needs to be within 2° of the analyses time step 
solar zenith angle, and also the observation local solar time 
needs to be within one hour of the analyses local solar time, 
for the observation to be used. This ensures that sunrise 
measurements are used at sunrise, and that sunset measure- 
ments are used at sunset. This is also important as there 
can be a significant difference between the geographic lati- 
tude and the equivalent PV latitude. As our analysis grid 
uses equivalent PV latitude it is important that we use the 
local solar zenith angle as the criteria for using an observa- 
tion, and not just the local solar time. In the case of longer 
lived species, or those without a significant diurnal cycle, we 
can use less stringent criteria. For these longer lived species 
we specify that the observation solar zenith angle needs to 
be within 6° of the analyses time step solar zenith angle, 
and also the observation local solar time needs to be within 
one hour of the analyses local solar time. These in fact are 
rather strict criteria for the long lived species. Sensitivity 
experiments have shown that much more lax values work 
just as well for the long lived species. 

4.1.2. Spatial Search Criteria. 

The criteria used to determine at what location we use 
an observation are equivalent PV latitude and poten- 
tial temperature (0). An observation is used in (j> e -0 grid 
box where it lies. The grid has 21 potential temperature 
levels spaced equally in log(0) between 400 K and 2000 K, 
and 32 equivalent PV latitudes spaced evenly between -90° 
and 90°. That means that no interaction between the boxes 
(meridional transport, mixing, diabatic ascent/descent) is 
considered. 

We have chosen to deal with a vertical profile of observa- 
tions at a time as vertical profiles can contain gaps. These 
are easy to fill in by eye, but for an algorithm to deal with 
the data voids we need to consider an entire profile at a time 
in our flow tracking co-ordinate space. In a full 3D assimila- 
tion the 3D spatial co- variance matrix would be performing 
this task. However, in this study we are using an array of 
0D boxes with a full Kalman filter and detailed chemistry. 
To make this computationally achievable we use multiple 
0D box models which are stacked into a series of profiles. 


Which gives us a 2D global assimilation with 21 potential 
temperature levels spaced equally in log(0) between 400 K 
and 2000 K, and 32 equivalent PV latitudes spaced evenly 
between -90° and 90°. 

So to clarify, the actual assimilation is done in a 2D array 
of completely independent boxes. However, the observation 
reconstructions used for this independent array of boxes are 
generated considering an entire profile at a time, where the 
profile is a function of potential temperature for a given 
equivalent PV latitude range. The equivalent PV latitude 
range is the width of the grid box. This is not ideal, but is a 
necessary and good approximation because of the computer 
time required to do a full 3D assimilation with a Kalman 
filter. 

Our observation reconstruction algorithm consists of the 
following steps: 

1. For each level in the analyses profile we determine 
which observations lie between the bottom of the level below 
and the top of the level above. 

2. If there are more than n thresh (usually taken to be 25) 
observations we sort them in order of their percentage un- 
certainty and just use the n thresh most reliable data points 
(i.e. those with the lowest concentration uncertainty to con- 
centration ratio). 

3. For these data points we calculate the median concen- 
tration. 

4. If there are any levels which have no observations 
bracketed above and below by levels which did have ob- 
servations we linearly interpolate using the available values 
above and below. 

4.2. Observation Uncertainties 

The uncertainty of the reconstructed observation has two 
components. First, for each (<p e ,0) grid box we have a dis- 
tribution of observed concentrations, and a distribution of 
observed concentration uncertainties. We take the observa- 
tional uncertainty to be the median observed concentration 
uncertainty for the current distribution in the given ((j> e ,0) 
grid box. Second, the representativeness uncertainty. Infor- 
mation on the representativeness uncertainty is gained by 
using all observations available within a grid cell and study- 
ing the probability distribution function (PDF) of these ob- 
servations. The width of the PDF is used as a measure of the 
variability or representativeness uncertainty for the (<f>e,0) 
grid cell. So both the observation and observation uncer- 
tainty used are directly from the data but selected in a way 
that we are sure they are truly representative of that grid 
cell. The average deviation, or mean absolute deviation, is 
a robust estimator of the width of the distribution [Press 
et al . , 1992]. 

1 N 

Orep = ADev(x 1 ...Xn) = jy -xl (1) 

i=i 

Where xj refers to the individual concentration observations 
available in the grid cell. The reconstructed observation for 
the (< p e ,0 ) grid box is the median of the 25 most accurate 
observations available in the grid box. The median is used so 
that the reconstructed observations, and hence the analyses, 
are not affected by outliers. 

Lary [submitteda] show that the representativeness un- 
certainty is an interesting quantity as it clearly picks out 
mixing barriers such as the polar vortices, the tropical pipe, 
and the tropopause. In addition, for species such as ozone 
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in polar day we see a large representativeness uncertainty at 
the summer pole. It is important to note that this represen- 
tativeness uncertainty is fully accounted for and goes in to 
the analyses and is reflected in the analyses uncertainty. 

Whatever co-ordinate system is used such situations will 
be encountered, it is therefore important to account for the 
grid representativeness uncertainty as accurately as possible 
and include them in the analyses. 

It is also fair to observe that this level of treatment of 
uncertainties is seldom, if ever, performed for atmospheric 
chemical modeling. It is suggested that this is a major step 
taken in this study. 

4.3. Photochemical Theory Uncertainties 

Currently we assume a growth rate in the photochemical 
theory uncertainty of 5% per time step (i.e. the diagonal of 
the co-variance matrix is increased by 5% at each time step 
to crudely account for photochemical theory uncertainties). 
The time step used here is 15 minutes, but we often use a 
one hour time step also. This is deliberately conservative to 
avoid filter divergence. In future we want to explicitly cal- 
culate the photochemical theory uncertainty by using the 
uncertainty in each rate coefficient and the constituent con- 
centrations. 

4.4. Analyses Quality Measures 

Once the analyses has been performed we quantify their 
quality by generating a set of statistics that compare the 
observations used in making the analyses with the analyses 
itself. 

1. Observation Increment, (O-F). This typically is 
the best measure of forecast skill. The difference between 
the first guess and the observations, also known as observed- 
minus- background differences or the innovation vector [ Da- 
ley , 1991). 

(O-F) = if >*-/*) (2) 

k= 1 

where Ok denotes the kth observation and fk the correspond- 
ing value from the first guess forecast. The lower the abso- 
lute value of (O-F) the better. 

2. Analysis Increment, (A-F). This is a good measure 
of model bias. The difference between the first guess and the 
final analyses, also known as analysis-minus- background dif- 
ferences or the correction vector [Daley, 1991). 

n 

(A-F) = i y>* - fk) (3) 

n L ' 

k= 1 

where ak denotes the analyses and fk the corresponding 
value from the first guess forecast. The lower the absolute 
value of (A-F) the better. 

3. Analysis-Observations (A-O). This typically is the 
best measure of the analyses bias. The difference between 
the analyses and the observations, also known as analysis- 
minus-observed differences or the innovation vector [Daley, 
1991). 

n 

(A-O) = iy >*-<*) (4) 

n z ' 

fc=i 

where Ok denotes the kth observation and ak the correspond- 
ing value from the analyses. The lower the absolute value 
of (A-O) the better. 


5. Assessment of the Assimilation 

ATMOS Atlas-1 observed 14 of the 55 species our assim- 
ilation scheme can simultaneously analyze. These 14 ob- 
served species include key species in reactive nitrogen and 
chlorine chemistry as well as several long lived source gases. 
The Atlas- 1 observations are only available at satellite sun- 
rise and sunset. In this case study for 30 March 1992 the 
measurements are mainly available at sunset. 

5.1. 0 3 

Figure 2 (a) displays vertical profiles for the 30 March 
1992 assimilation (ATMOS Atlas- 1) of the O3 analyses con- 
centrations, uncertainties, and skill measures. The left hand 
panel shows that there is excellent agreement between the 
ATMOS observations (diamonds with error bars) and the 
analyses (line with error bars). 

The total observational uncertainty has two components, 
the measurement uncertainty (solid line in the right hand 
panel) and the representativeness uncertainty (dashed line). 
The representativeness uncertainty attempts to quantify the 
spatial variability of ozone over the grid box. This has been 
quantified by using the average deviation of observed in each 
grid box. The analyses uncertainty is the diagonal element 
of the time integrated Kalman filter error co- variance matrix 
(solid line with overlaid circles). It can be seen that these 
uncertainties are of the order of 0.1 ppmv, or a few percent. 

The analyses bias, (O-A), is small and well within the 
observational uncertainties. The Forecast skill, (O-F), and 
model bias (A-F), are a little larger. This is because the 
apriori first guess was not entirely suitable for the situation 
ATMOS observed. As was mentioned in the introduction it 
came from a 2D model of the atmosphere. 

5.2. NO and N0 2 

Full diurnal cycles produced by the assimilation for NO 
and NO2 from the occupation measurements are shown in 
Figure 1. The left hand plots show the analyses overlaid 
with the observations (color filled triangles). The right hand 
plots show the analyses uncertainties overlaid with the ob- 
servation uncertainties (color filled triangles). The reduction 
in analyses uncertainty can be seen in the right hand plots 
when new observational information comes in. The dots 
indicate the locations of the assimilation analyses grid. 

Figure 2 (b) and (c) display vertical slices through the NO 
and NO2 diurnal cycles at sunset. The left hand panels of 
Figure 2 (b) and (c) shows that there is good agreement be- 
tween the ATMOS observations (diamonds with error bars) 
and the analyses (line with error bars). 

5.3. N 2 O 5 , HNO 3 , and HO 2 NO 2 

Figure 3 (a) to (c) display vertical slices through the 
N2O5, HNO3, and HO2NO2 diurnal cycles at sunset. For 
each of these species the analyses and observations agree to 
within the analyses uncertainty. 

5.4. Other Species 

Figure 4 displays vertical slices through the C 10 N 0 2 , 
HC 1 , HCN, NO2, CH 4 , H 2 0 , CO and C 0 2 diurnal cycles 
at sunset. For each of these species the analyses and ob- 
servations agree to within the analyses uncertainty. It is 
noteworthy that in some cases (for example CIONO2) the 
assimilation gives a larger uncertainty than was specified by 
the measurement team. This occurs for at least two reasons: 
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Figure 3. Vertical profiles for N 2 0 5 , HN0 3 , and HO 2 NO 2 of analyses and observations (left panel). 
The middle panel shows the skill measures described in §4.4, namely the observation increment (0-F) 
which is a measure of forecast skill, the analysis increment (A-F) which is a measure of model bias, and 
the analysis-observations (A-O) which is a measure of the analyses bias. The right hand panel shows the 
uncertainties, observational, representativeness, and analyses. All panels are from the 30 March 1992 
assimilation of ATMOS Atlas- 1 data at 38°S. Observations and observation error bars in blue. 
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Figure 4. Vertical profiles for CIONO 2 , HCI, HCN, NO 2 , CH 4 , H 2 O, CO and C0 2 analyses from the 
30 March 1992 assimilation of ATMOS Atlas- 1 data at 38°S. Observations and observation error bars in 
blue. 
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1. The assimilation also accounts for the representative- 
ness uncertainty. In some cases where there are large spatial 
gradients this uncertainty can be larger, even considerably 
larger, than the observational uncertainty. The analyses un- 
certainty reflects this as well as the photochemical model 
uncertainty. 

2. Information on this species is propagating through the 
assimilation system from observations of other chemically 
coupled species. This other information with its associated 
weight (uncertainty) is not entirely consistent with the obser- 
vations and so the analyses acquires a higher uncertainty, for 
example due to the presence of observation or model biases. 

6. Data-Theory Intercomparison 

A key advantage of using data assimilation is the prop- 
agation of information from data-rich regions to data- poor 
regions, which is very relevant to the assimilation of solar 
occultation data such as from ATMOS. 

6.1. Mathematical Framework For Intercomparison 

Previous studies, such as Chang et al [1996], have shown 
good agreement of volume mixing ratio profiles measured 
by ATMOS with in situ measurements acquired from plat- 
forms on the NASA ER-2. Sen et al [1998] presented volume 
mixing ratio profiles of NO, NO 2 , HONO, HO 2 NO 2 , N 2 O 5 , 
and CIONO 2 and their composite budget (NO y ), from 20 
to 39 km, measured remotely in solar occultation by the 
Jet Propulsion Laboratory MkIV Interferometer during a 
balloon flight from Fort Sumner, New Mexico (35°N), on 
September 25, 1993. In general, the observed profiles agree 
well with values calculated using a photochemical steady 
state model constrained by simultaneous MkIV observations 
of long-lived precursors and aerosol surface area from the 
Stratospheric Aerosol and Gas Experiment II. 

What is new in this study is that the comparison between 
observations and photochemical theory, and the constraint 
of photochemical theory by observations is done within an 
objective mathematical framework where full account is 
taken of information uncertainty both in the observations 
and photochemical theory. Previous ATMOS intercompar- 
isons have not done this. 

After examining the figures in this paper it may be asked, 
‘How much information is ATMOS providing, and how is 
this any different from what we would obtain from using 
a free running model?’ ATMOS has provided a great deal 
of information for the figures shown here. If it was not for 
the constraint of ATMOS Atlas- 1 observations the sunset 
observations would not have agreed with the photochemi- 
cal model, and not simultaneously agreed with the O3, NO, 
N0 2 , N 2 0 5 , HNO 3 , HO 2 NO 2 , HCN, CIONO 2 , HC1, H 2 0, 
CO, CO 2 , CH 4 , and N 2 0. 

6.2. Quantifying Uncertainty 

If the primary interest is the likely magnitude of diurnal 
variation then one does NOT need assimilation. The rea- 
son for using the assimilation is to simultaneously use as 
much information as we have and to provide an estimate of 
the uncertainty associated with this information which in- 
cludes a treatment of observational uncertainty, spatial and 
temporal representativeness uncertainty, and photochemical 
theory uncertainty. Assimilation also allows objective self- 
consistency tests such as those described in §4.4, and the x 2 
test of (O-F) proposed by Dee [1995], applied by Menard 


et al [2000]; Menard and Chang [2000], and Khattatov et al. 
[2000]. The currently available techniques other than assim- 
ilation do not provide this capability. 

6.3. Inferring Information on Unobserved Quantities 

In the case of a quality dataset, such as ATMOS, the as- 
similation can reliably produce full diurnal cycles from just 
occultation observations, and relatively reliably infer anal- 
yses for unobserved species. In this case study full diur- 
nal analyses and uncertainties of 55 species were produced 
from observations of just 14 of these species. In addition 
the assimilation has provided a useful check for photochem- 
ical self-consistency between the ATMOS observations, and 
between the observations and photochemical theory. Self- 
consistency is not always present between the multi con- 
stituent observations made from a given platform, for ex- 
ample UARS observed N0 2 from three instruments which 
were not self-consistent with each other. This will be the 
subject of further papers. 

The reason for choosing 38° S for this case study is that 
at this latitude there was the greatest vertical extent of AT- 
MOS Atlas- 1 observations. The good agreement obtained 
between ATMOS observations and photochemical theory re- 
ported here was not the result of a fortuitous choice of data. 
Many other studies have been performed with data from nu- 
merous platforms including other ATMOS missions, MkIV 
balloon flights, ER-2 data, and UARS data. Space does not 
allow to present these here, they will be the subject of other 
papers. 

Observations from other platforms and measurement 
campaigns have been considered, such as the Strato- 
spheric Photochemistry Aerosol and Dynamics Experiment 
(SPADE) in May 1993 and the Cryogenic Limb Array Etalon 
Spectrometer (CLAES), which have much more diurnal cov- 
erage than just sunrise and sunset and show the same result. 
Chemical data assimilation can reconstruct well an entire di- 
urnal cycle from reliable observations at one time. Whether 
the observations are at sunrise, sunset or spaced throughout 
the day. 

This means that multispecie chemical data assimilation is 
likely to be useful for data product monitoring and quality 
assessment. This is relevant to the use of data assimila- 
tion within a validation campaign of new instruments such 
as those to be placed on board EOS AURA. Solar occulta- 
tion data (such as from the Canadian Atmospheric Chem- 
istry Experiment, ACE, and HALOE), aircraft data (such 
as from the DC-8 and ER-2) and balloon data (such as the 
MkIV) could be used to reconstruct full-diurnal cycles of 
species such as NO and N0 2l thus relaxing the local solar 
time/zenith angle coincidence requirements needed to vali- 
date AURA observations. 

7. Summary 

Multispecie chemical data assimilation is useful for con- 
stituent data product monitoring/validation and quality as- 
sessment. Two key advantages of using data assimilation is 
the ability to check the chemical self-consistency of multi- 
specie observations and information prorogating from data- 
rich regions to data-poor regions. This is particularly rel- 
evant to the assimilation of solar occultation data such as 
from ATMOS. 
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Chemical data assimilation has been able to take solar 
oeeultation measurements of 14 species at sunrise and sun- 
set and produce full diurnal cycles for 55 observed and un- 
observed species together with their associated uncertainty. 
The associated uncertainty includes a treatment of the ob- 
servation, representativeness, and photochemical theory un- 
certainty. 

We confirm that there is self-consistency between the AT- 
MOS At las- 1 observations of O 3 , NO, N0 2j N 2 O 5 , HNO 3 , 
HO 2 NO 2 , HCN, C10N0 2? HC1, HaO, CO, C0 2 , CH 4 , and 
N 2 0, and between ATMOS and photochemical theory If 
there had been inconsistency between the observations and 
photochemical theory the assimilation skill scores such as 
(O-F), and (O-A) would have highlighted these and quanti- 
fied their magnitude. 

Future studies in preparation using chemical data assim- 
ilation include those to look at: The halogen catalyzed 
oxidation of hydrocarbons in the upper troposphere and 
lower stratosphere. The partitioning of hydrogen, nitro- 
gen, chlorine, bromine, and carbon species as a function 
of temperature, solar illumination and aerosol loading. The 
relative roles of catalytic ozone loss cycles and how they 
have changed with time and location over the UARS period. 
Satellite instrument validation including Envisat, ACE, and 
AURA. 
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